Human HSET motor proteins and methods for their use

ABSTRACT

The present invention provides high throughput screening systems for identifying compounds useful in the treatment of cellular proliferation disorders. The method can be performed in plurality simultaneously with fluorescence or absorbance readouts.

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a continuation-in-part application of U.S. Ser. No. 09/295,612 filed Apr. 20, 1999, which is incorporated herein by reference.

FIELD OF THE INVENTION

The invention relates to methods for the identification of compounds that modulate the activity of target proteins having motor domains and use of such methods for the identification of therapeutic agents.

BACKGROUND OF THE INVENTION

The kinesin superfamily is an extended family of related microtubule motor proteins. It can be classified into at least 8 subfamilies based on primary amino acid sequence, domain structure, velocity of movement, and cellular function. This family is exemplified by “true” kinesin, which was first isolated from the axoplasm of squid, where it is believed to play a role in anterograde axonal transport of vesicles and organelles (see, e.g., Goldstein, Annu. Rev. Genet. 27:319-351 (1993)).

Mitotic kinesins are enzymes essential for assembly and function of the mitotic spindle, but are not generally part of other microtubule structures. Mitotic kinesins play essential roles during all phases of mitosis. These enzymes are “molecular motors” that translate energy released by hydrolysis of ATP into mechanical force which drives the directional movement of cellular cargoes along microtubules. The catalytic domain sufficient for this task is a compact structure of approximately 340 amino acids. During mitosis, kinesins organize microtubules into the bipolar spindle that is the mitotic spindle. Kinesins mediate movement of chromosomes along spindle microtubules, as well as structural changes in the mitotic spindle associated with specific phases of mitosis. Experimental perturbation of mitotic kinesin function causes malformation or dysfunction of the mitotic spindle, frequently resulting in cell cycle arrest.

Within this functional group of kinesins resides a group of kinesins from several organisms that share significant sequence homology, the KAR3 family of minus end-directed motor proteins. These include, but are not limited to, HSET (the human homologue of the KAR3 family); Drosophila melanogaster nonclaret disjunctional (“Dmncd”); C. elegans Klp-3; MmKifC1; X1XCTKS, AtKatA, AtKatB, AtKatC, AnKLPA, SpoKLP2, DdKRPK2, SpoKLP1, ScKAR3, CgCHO2, and the like.

One of the best studied members of this family is ncd. Ncd mutation leads to spindles with splayed poles that are frequently split into multiple distinct foci, and spurs of microtubules have been observed to project from the main body of these spindles. This motor and its homologues are believed to contribute to both the overall structural integrity of the spindle and the efficiency of spindle formation by focusing microtubule minus ends.

HSET has been shown to localize between microtubules in the metaphase spindle of human cells. It has also been shown that HSET is essential to establish cohesive poles in mouse meiotic spindles. HSET is believed to act antagonistically to the plus end-directed activity of KSP, both in vitro and in vivo. These two motor proteins, through cross-linking and oppositely oriented motor activity, are thought to generate a well-ordered framework of microtubule bundles within the spindle. This cross-linking activity is important for the overall structural stability of the spindle lattice. Thus, the kinesin HSET plays an important role in the mitotic process. See, e.g., Mountain et al. (1999) J. Cell Biol. 147:351; Sawin and Endow (1993) Bioessays, 15:399; Khan et al. (1997) J. Mol. Biol. 270:627; Nakagawa et al. (1997) Proc. Natl. Acad. Sci. USA 94:9654; and Hirokawa et al. (1998) Science 279:519.

Defects in function of HSET could be expected to result in cell cycle arrest in mitosis. As such, compounds that modulate the activity of this kinesin may affect cellular proliferation. The present invention provides a novel method to identify such compounds.

SUMMARY OF THE INVENTION

The present invention provides methods to identify candidate agents that bind to a target protein or act as a modulator of the binding characteristics or biological activity of a target protein. In one embodiment, the method is performed in plurality simultaneously. For example, the method can be performed at the same time on multiple assay mixtures in a multi-well screening plate. Furthermore, in a preferred embodiment, fluorescence or absorbance readouts are utilized to determine activity. Thus, in one aspect, the invention provides a high throughput screening system for detecting modulators of activity a target protein.

In one embodiment, the present invention provides a method of identifying a candidate agent as a modulator of the activity of a target protein. The method comprises adding a candidate agent to a mixture comprising a target protein which directly or indirectly produces ADP or phosphate, under conditions that normally allow the production of ADP or phosphate. The method further comprises subjecting the mixture to a reaction that uses said ADP or phosphate as a substrate under conditions that normally allow the ADP or phosphate to be utilized and determining the level of activity of the reaction as a measure of the concentration of ADP or phosphate. A change in the level between the presence and absence of the candidate agent indicates a modulator of the target protein.

The phrase “use ADP or phosphate” means that the ADP or phosphate are directly acted upon by detection reagents. In one case, the ADP, for example, can be hydrolyzed or can be phosphorylated. As another example, the phosphate can be added to another compound. As used herein, in each of these cases, ADP or phosphate is acting as a substrate.

Preferably, the target protein either directly or indirectly produces ADP or phosphate and comprises a motor domain. More preferably, the target protein comprises HSET or a fragment thereof. Most preferably, the target protein comprises SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6.

Also provided are modulators of the target protein including agents for the treatment of cellular proliferation, including cancer, hyperplasias, restenosis, cardiac hypertrophy, immune disorders and inflammation. The agents and compositions provided herein can be used in variety of applications which include the formulation of sprays, powders, and other compositions. Also provided herein are methods of treating cellular proliferation disorders such as cancer, hyperplasias, restenosis, cardiac hypertrophy, immune disorders and inflammation, for treating disorders associated with HSET activity, and for inhibiting HSET.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows an embodiment of a nucleic acid sequence encoding a particularly preferred target protein (SEQ ID NO:1) wherein the start and stop codons are framed.

FIG. 2 shows an embodiment of a particularly preferred target protein (SEQ ID NO:2). The construct contains residues 151 through 510 of the full length HSET enzyme.

FIG. 3 shows an embodiment of a nucleic acid sequence encoding a particularly preferred target protein (SEQ ID NO:3) wherein the start and stop codons are framed.

FIG. 4 shows an embodiment of another particularly preferred target protein (SEQ ID NO:4). The construct contains residues 151 through 519 of the fall length HSET enzyme.

FIG. 5 shows an embodiment of a nucleic acid sequence encoding a particularly preferred target protein (SEQ ID NO:5) wherein the start and stop codons are framed.

FIG. 6 shows an embodiment of another particularly preferred target protein (SEQ ID NO:6). The construct contains residues 152 through 519 of the fall length HSET enzyme.

DETAILED DESCRIPTION OF THE INVENTION I. Definitions

“ADP” refers to adenosine diphosphate and also includes ADP analogs, including, but not limited to, deoxyadenosine diphosphate (dADP) and adenosine analogs.

“Biologically active” target protein refers to a target protein that has one or more of kinesin protein's biological activities, including, but not limited to microtubule stimulated ATPase activity, as tested, e.g., in an ATPase assay. Biological activity can also be demonstrated in a microtubule gliding assay or a microtubule binding assay. “ATPase activity” refers to ability to hydrolyze ATP. Other activities include polymerization/depolymerization (effects on microtubule dynamics), binding to other proteins of the spindle, binding to proteins involved in cell-cycle control, or serving as a substrate to other enzymes, such as kinases or proteases and specific kinesin cellular activities, such as chromosome congregation, axonal transport, etc.

“Biological sample” as used herein is a sample of biological tissue or fluid that contains a target protein or a fragment thereof or nucleic acid encoding a target protein or a fragment thereof. Biological samples may also include sections of tissues such as frozen sections taken for histological purposes. A biological sample comprises at least one cell, preferably plant or vertebrate. Embodiments include cells obtained from a eukaryotic organism, preferably eukaryotes such as fungi, plants, insects, protozoa, birds, fish, reptiles, and preferably a mammal such as rat, mice, cow, dog, guinea pig, or rabbit, and most preferably a primate such as chimpanzees or humans.

A “comparison window” includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 25 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the global alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity methods of Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444 (1988) and Altschul et al. Nucleic Acids Res. 25(17): 3389-3402 (1997), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and BLAST in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by manual alignment and visual inspection (see, e.g., Ausubel et al., supra).

One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments. It can also plot a dendrogram showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360 (1987). The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153 (1989). As a general rule, PileUp can align up to 500 sequences, with any single sequence in the final alignment restricted to a maximum length of 7,000 characters.

The multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster can then be aligned to the next most related sequence or cluster of aligned sequences. Two clusters of sequences can be aligned by a simple extension of the pairwise alignment of two individual sequences. A series of such pairwise alignments that includes increasingly dissimilar sequences and clusters of sequences at each iteration produces the final alignment.

“Variant” applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCT all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are “silent variations,” which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each degenerate codon in a nucleic acid can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in each described sequence.

Also included within the definition of target proteins of the present invention are amino acid sequence variants of wild-type target proteins. These variants fall into one or more of three classes: substitutional, insertional or deletional variants. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the target protein, using cassette or PCR mutagenesis or other techniques well known in the art, to produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture. Variant target protein fragments having up to about 100-150 amino acid residues may be prepared by in vitro synthesis using established techniques. Amino acid sequence variants are characterized by the predetermined nature of the variation, a feature that sets them apart from naturally occurring allelic or interspecies variation of the target protein amino acid sequence. The variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, although variants can also be selected which have modified characteristics.

Amino acid substitutions are typically of single residues; insertions usually will be on the order of from about 1 to about 20 amino acids, although considerably longer insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in some cases, deletions may be much longer.

Substitutions, deletions, and insertions or any combinations thereof may be used to arrive at a final derivative. Generally, these changes are done on a few amino acids to minimize the alteration of the molecule. However, larger characteristics may be tolerated in certain circumstances.

The following six groups each contain amino acids that are conservative substitutions for one another:

1) Alanine (A), Serine (S), Threonine (T);

2) Aspartic acid (D), Glutamic acid (E);

3) Asparagine (N), Glutamine (Q);

4) Arginine (R), Lysine (K);

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).

(see, e.g., Creighton, Proteins (1984)).

“Cytoskeletal component” denotes any molecule that is found in association with the cellular cytoskeleton, that plays a role in maintaining or regulating the structural integrity of the cytoskeleton, or that mediates or regulates motile events mediated by the cytoskeleton. Includes cytoskeletal polymers (e.g., actin filaments, microtubules, intermediate filaments, myosin fragments), molecular motors (e.g., kinesins, myosins, dyneins), cytoskeleton associated regulatory proteins (e.g., tropomysin, alpha-actinin) and cytoskeletal associated binding proteins (e.g., microtubules associated proteins, actin binding proteins).

“Cytoskeletal function” refers to biological roles of the cytoskeleton, including but not limited to the providing of structural organization (e.g., microvilli, mitotic spindle) and the mediation of motile events within the cell (e.g., muscle contraction, mitotic chromosome movements, contractile ring formation and function, pseudopodal movement, active cell surface deformations, vesicle formation and translocation.)

A “diagnostic” as used herein is a compound, method, system, or device that assists in the identification and characterization of a health or disease state. The diagnostic can be used in standard assays as is known in the art.

An “expression vector” is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be transcribed operably linked to a promoter.

“High stringency conditions” may be identified by those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50° C.; (2) employ during hybridization a denaturing agent such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42° C.; or (3) employ 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC (sodium chloride/sodium citrate) and 50% formamide at 55° C., followed by a high-stringency wash consisting of 0.1×SSC containing EDTA at 55° C.

“High throughput screening” as used herein refers to an assay which provides for multiple candidate agents or samples to be screened simultaneously. As further described below, examples of such assays may include the use of microtiter plates which are especially convenient because a large number of assays can be carried out simultaneously, using small amounts of reagents and samples.

By “host cell” is meant a cell that contains an expression vector and supports the replication or expression of the expression vector. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells such as CHO, HeLa and the like, or plant cells. Both primary cells and cultured cell lines are included in this definition.

The phrase “hybridizing specifically to” refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (T_(m)) for the specific sequence at a defined ionic strength and pH. The T_(m) is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equilibrium. Typically, stringent conditions will be those in which the salt concentration is less than about 1.0 M sodium ion, typically about 0.05 to 1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.

The terms “identical” or percent “identity”, in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence over a comparison window, as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Preferably, the percent identity exists over a region of the sequence that is at least about 25 amino acids in length, more preferably over a region that is 50 or 100 amino acids in length. This definition also refers to the complement of a test sequence, provided that the test sequence has a designated or substantial identity to a reference sequence. Preferably, the percent identity exists over a region of the sequence that is at least about 25 nucleotides in length, more preferably over a region that is 50 or 100 nucleotides in length.

When percentage of sequence identity is used in reference to proteins or peptides, it is recognized that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g,. charge or hydrophobicity) and therefore do not change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art. The scoring of conservative substitutions can be calculated according to, e.g., the algorithm of Meyers & Millers, Computer Applic. Biol. Sci. 4:11-17 (1988), e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif.).

The terms “isolated”, “purified”, or “biologically pure” refer to material that is substantially or essentially free from components which normally accompany it as found in its native state. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified. In an isolated target gene, the nucleic acid of interest is separated from open reading frames which flank the target gene and encode proteins other than the target protein. The term “purified” denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and most preferably at least 99% pure.

A “label” is a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include fluorescent proteins such as green, yellow, red or blue fluorescent proteins, radioisotopes such as ³²P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins for which antisera or monoclonal antibodies are available (e.g., the polypeptide of SEQ ID NO:2 can be made detectable, e.g., by incorporating a radio-label into the peptide, and used to detect antibodies specifically reactive with the peptide).

“Moderately stringent conditions” may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and % SDS) less stringent than those described above. An example of moderately stringent conditions is overnight incubation at 37° C. in a solution comprising: 20% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 μg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1×SSC at about 37-50° C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like.

“Modulators,” “inhibitors,” and “activators of a target protein” refer to modulatory molecules identified using in vitro and in vivo assays for target protein activity. Such assays include ATPase activity, microtubule gliding, microtubule depolymerizing activity, and binding activity such as microtubule binding activity or binding of nucleotide analogs. Samples or assays that are treated with a candidate agent at a test and control concentration. The control concentration can be zero. If there is a change in target protein activity between the two concentrations, this change indicates the identification of a modulator. A change in activity, which can be an increase or decrease, is preferably a change of at least 20% to 50%, more preferably by at least 50% to 75%, more preferably at least 75% to 100%, and more preferably 150% to 200%, and most preferably is a change of at least 2 to 10 fold compared to a control. Additionally, a change can be indicated by a change in binding specificity or substrate.

“Molecular motor” or “motor protein” refers to a molecule that utilizes chemical energy to generate mechanical force. According to one embodiment, the molecular motor drives the motile properties of the cytoskeleton.

The phrase “motor domain” refers to the domain of a target protein that confers membership in the kinesin superfamily of motor proteins through a sequence identity of approximately 35-45% identity to the motor domain of true kinesin.

The term “nucleic acid” refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences as well as the sequence explicitly indicated. For example, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260)2605-2608 (1985); Cassol et al. 1992; Rossolini et al. Mol. Cell. Probes 8:91-98 (1994)). The term nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by a gene.

“Nucleic acid probe or oligonucleotide” is defined as a nucleic acid capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases. In addition, the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, for example, probes may be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be understood by one of skill in the art that probes may bind target sequences lacking complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. The probes are preferably directly labeled with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidine complex may later bind. By assaying for the presence or absence of the probe, one can detect the presence or absence of the select sequence or subsequence.

The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residues is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. A target protein comprises a polypeptide demonstrated to have at least microtubule stimulated ATPase activity. Amino acids may be referred to herein by either their commonly known three letter symbols or by Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes, i.e., the one-letter symbols recommended by the IUPAC-IUB.

A “promoter” is defined as an array of nucleic acid control sequences that direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of a polymerase II type promoter, a TATA box element. A promoter also optionally includes distal enhancer or repressor elements which can be located as much as several thousand base pairs from the start site of transcription. A “constitutive” promoter is a promoter that is active under most environmental and developmental conditions. An “inducible” promoter is a promoter that is under environmental or developmental regulation. The term “operably linked” refers to a functional linkage between a nucleic acid expression control sequence (such as a promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence directs transcription of the nucleic acid corresponding to the second sequence.

The phrase “specifically (or selectively) binds” to an antibody or “specifically (or selectively) immunoreactive with,” when referring to a protein or peptide, refers to a binding reaction that is determinative of the presence of the protein in a heterogeneous population of proteins and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein at least two times the background and do not substantially bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. For example, antibodies raised to the target protein with the amino acid sequence encoded in SEQ ID NO:2 can be selected to obtain only those antibodies that are specifically immunoreactive with the target protein and not with other proteins, except for polymorphic variants, orthologs, alleles, and closely related homologues of HSET. This selection may be achieved by subtracting out antibodies that cross react with molecules, for example, such as C. elegans unc-104 and human Kif1A. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays are routinely used to select antibodies specifically immunoreactive with a protein (see, e.g., Harlow & Lane, Antibodies, A Laboratory Manual (1988), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity). Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background.

The phrase “selectively associates with” refers to the ability of a nucleic acid to “selectively hybridize” with another as defined above, or the ability of an antibody to “selectively (or specifically) bind to a protein, as defined above.

“Test composition” (used interchangeably herein with “candidate agent” and “test compound” and “test agent”) refers to a molecule or composition whose effect on the interaction between one or more cytoskeletal components it is desired to assay. The “test composition” can be any molecule or mixture of molecules, optionally in a carrier.

A “therapeutic” as used herein refers to a compound which is believed to be capable of modulating the cytoskeletal system in vivo which can have application in both human and animal disease. Modulation of the cytoskeletal system would be desirable in a number of conditions including, but not limited to: abnormal stimulation of endothelial cells (e.g., atherosclerosis), solid and hematopoetic tumors and tumor metastasis, benign tumors, for example, hemangiomas, acoustic neuromas, neurofibromas, pyogenic granulomas, vascular malfunctions, abnormal wound healing, inflammatory and immune disorders such as rheumatoid arthritis, Bechet's disease, gout or gouty arthritis, abnormal angiogenesis accompanying: rheumatoid arthritis, psoriasis, diabetic retinopathy, and other ocular angiogenic disesase such as, macular degeneration, corneal graft rejection, corneal overgrowth, glaucoma, and Osler Webber syndrome.

II. The Target Protein

According to the present invention, a target protein is a molecule that either directly or indirectly produces ADP or phosphate and that comprises a motor domain. In a preferred embodiment, the target protein is an enzyme having activity which produces ADP and/or phosphate as a reaction product. Also included within the definition of the target proteins are amino acid sequence variants of wild-type target proteins.

Target proteins of the present invention may also be modified in a way to form chimeric molecules comprising a fusion of a target protein with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino or carboxyl terminus of the target protein. Provision of the epitope tag enables the target protein to be readily detected, as well as readily purified by affinity purification. Various tag epitopes are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 (see, Field et al. (1988) Mol. Cell. Biol. 8:2159); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto (see, Evans et al., (1985) Molecular and Cellular Biology, 5:3610); and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (see, Paborsky et al., (1990) Protein Engineering, 3:547). Other tag polypeptides include the Flag-peptide (see, Hopp et al. (1988) BioTechnology 6:1204); the KT3 epitope peptide (see, Martine et al. (1992) Science, 255:192); tubulin epitope peptide (see, Skinner (1991) J. Biol. Chem. 266:15173); and the T7 gene 10 protein peptide tag (see, Lutz-Freyermuth et al. (1990) Proc. Natl. Acad. Sci. USA 87:6393. Target proteins of the present invention are meant to include both the untagged target protein as well as the chimeric protein wherein the target protein has been fused to one or more tag epitopes.

In a particularly preferred embodiment, the target protein comprises HSET or a fragment thereof.

In another aspect of this invention, the target protein comprises an amino acid sequence which has greater than 70% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, preferably greater than 80%, more preferably greater than 90%, more preferably greater than 95% or, in another embodiment, has 98 to 100% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6.

In a particularly preferred embodiment, a fragment of the HSET protein comprising a portion of its hydrolytically active “motor” domain is used. This motor domain has been cloned and expressed in bacteria such that large quantities of biochemically active, substantially pure protein are available. Preferably, the target protein comprises an amino acid sequence which has greater than 70% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, preferably greater than 80%, more preferably greater than 90%, more preferably greater than 95% or, in another embodiment, has 98 to 100% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6.

A particularly preferred embodiment is drawn to a fragment of the HSET protein SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. More preferably, this fragment is tagged at the C-terminus with a myc epitope and 6 histidines. More preferably, this fragment is tagged at the N-terminus with a T7 epitope and at the C-terminus with a myc epitope and 6 histidines.

In one aspect, the nucleic acids provided herein are defined by the proteins encoded thereby. A preferred embodiment of the invention is drawn to an isolated nucleic acid sequence encoding a microtubule motor protein, wherein the motor protein has the following properties: (i) the protein's activity includes microtubule stimulated ATPase activity; and (ii) the protein has a sequence that has greater than 70% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, preferably greater than 80%, more preferably greater than 90%, more preferably greater than 95% or, in another embodiment, has 98 to 100% sequence identity with SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. In one embodiment, the nucleic acid encodes HSET or a fragment thereof. In another embodiment, the nucleic acid encodes SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6.

In one embodiment, the nucleic acid comprises a sequence which has one or more of the following characteristics: greater than 55 or 60% sequence identity with SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5, preferably greater than 70%, more preferably greater than 80%, more preferably greater than 90 or 95% or, in another embodiment, has 98 to 100% sequence identity with SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5. In another embodiment provided herein, the nucleic acid hybridizes under stringent conditions to a nucleic acid having a sequence or complementary sequence of SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5. In another embodiment, the nucleic acid has a nucleotide sequence of SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5. As described above, when describing the nucleotide in terms of SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5, the sequence identity may be slightly lower due to the degeneracy in the genetic code.

As will be appreciated by those in the art, the target proteins can be made in a variety of ways, including both synthesis de novo and by expressing a nucleic acid encoding the protein.

Numerous suitable methods for recombinant protein expression, including generation of expression vectors, generation of fusion proteins, introducing expression vectors into host cells, protein expression in host cells, and purifications methods are known to those in the art.

In a preferred embodiment, the target proteins are purified for use in the assays to provide substantially pure samples. Alternatively, the target protein need not be substantially pure as long as the sample comprising the target protein is substantially free of other components that can contribute to the production of ADP or phosphate.

The target proteins may be isolated or purified in a variety of ways known to those skilled in the art depending on what other components are present in the sample. Standard purification methods include electrophoretic, molecular, immunological, and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and chromatofocussing. For example, the target protein can be purified using a standard anti-target antibody column. Ultrafiltration and diafiltration techniques, in conjunction with protein concentration, are also useful.

Either naturally occurring or recombinant target protein can be purified for use in functional assays. The target protein may be purified to substantial purity by standard techniques, including selective precipitation with such substances as ammonium sulfate; column chromatography, immunopurification methods, and others (see, e.g., Scopes, Protein Purification: Principles and Practice (1982); U.S. Pat. No. 4,673,641; Ausubel et al. supra; and Sambrook et al., supra). A preferred method of purification is use of Ni-NTA agarose (Qiagen).

Suitable purification schemes for some specific kinesins are outlined in U.S. Ser. No. 09/295,612, filed Apr. 20, 1999, hereby expressly incorporated herein in its entirety for all purposes.

The expressed protein can be purified by standard chromatographic procedures to yield a purified, biochemically active protein. The activity of any of the peptides provided herein can be routinely confirmed by the assays provided herein such as those which assay ATPase activity or microtubule binding activity. Biologically active target protein is useful for identifying modulators of target protein or fragments thereof and kinesin superfamily members using in vitro assays such as microtubule gliding assays, ATPase assays (Kodama et al., J. Biochem. 99:1465-1472 (1986); Stewart et al., Proc. Nat'l Acad. Sci. USA 90:5209-5213 (1993)), and binding assays including microtubule binding assays (Vale et al., Cell 42:39-50 (1985)), as described in detail below.

III. Assays for Modulators of the target protein

A. Functional Assays

Assays that can be used to test for modulators of the target protein include a variety of in vitro or in vivo assays, e.g., microtubule gliding assays, binding assays such as microtubule binding assays, microtubule depolymerization assays, and ATPase assays (Kodama et al., J. Biochem. 99: 1465-1472 (1986); Stewart et al., Proc. Nat'l Acad. Sci. USA 90: 5209-5213 (1993); (Lombillo et al., J. Cell Biol. 128:107-115 (1995); (Vale et al., Cell 42:39-50 (1985)).

Modulation is tested by screening for candidate agents capable of modulating the activity of the target protein comprising the steps of combining a candidate agent with the target protein, as above, and determining an alteration in the biological activity of the target protein. Thus, in this embodiment, the candidate agent should both bind to the target protein (although this may not be necessary), and alter its biological or biochemical activity as defined herein. The methods include both in vitro screening methods and in vivo screening of cells for alterations in cell cycle distribution, cell viability, or for the presence, morphology, activity, distribution, or amount of mitotic spindles, as are generally outlined above.

In a preferred embodiment, molecular motor activity is measured by the methods disclosed in Ser. No. 09/314,464, filed May 18, 1999, entitled “Compositions and assay utilizing ADP or phosphate for detecting protein modulators”, which is incorporated herein by reference in its entirety. More specifically, this assay detects modulators of any aspect of a kinesin motor function ranging from interaction with microtubules to hydrolysis of ATP. ADP or phosphate is used as the readout for protein activity.

There are a number of enzymatic assays known in the art which use ADP as a substrate. For example, kinase reactions such as pyruvate kinases are known. See, Nature 78:632 (1956) and Mol. Pharmacol. 6:31 (1970). This is a preferred method in that it allows the regeneration of ATP. In one embodiment, the level of activity of the enzymatic reaction is determined directly. In a preferred embodiment, the level of activity of the enzymatic reaction which uses ADP as a substrate is measured indirectly by being coupled to another reaction. For example, in one embodiment, the method further comprises a lactate dehydrogenase reaction under conditions which normally allow the oxidation of NADH, wherein said lactate dehydrogenase reaction is dependent on the pyruvate kinase reaction. Measurement of enzymatic reactions by coupling is known in the art. Furthermore, there are a number of reactions which utilize phosphate. Examples of such reactions include a purine nucleoside phosphorylase reaction. This reaction can be measured directly or indirectly. A particularly preferred embodiments utilizes the pyruvate kinase/lactate dehydrogenase system.

In one embodiment, the detection of the ADP or phosphate proceeds non-enzymatically, for example, by binding or reacting the ADP or phosphate with a detectable compound. For example, phosphomolybdate based assays may be used which involve conversion of free phosphate to a phosphomolybdate complex. One method of quantifying the phosphomolybdate is with malachite green. Alternatively, a fluorescently labeled form of a phosphate binding protein, such as the E. coli phosphate binding protein, can be used to measure phosphate by a shift in its fluorescence.

In addition, target protein activity can be examined by determining modulation of target protein in vitro using cultured cells. The cells are treated with a candidate agent and the effect of such agent on the cells is then determined either directly or by examining relevant surrogate markers. For example, characteristics such as mitotic spindle morphology and cell cycle distribution can be used to determine the effect.

Thus, in a preferred embodiment, the methods comprise combining a target protein and a candidate agent, and determining the effect of the candidate agent on the target protein. Generally a plurality of assay mixtures are run in parallel with different agent concentrations to obtain a differential response to the various concentrations. Typically, one of these concentrations serves as a negative control, i.e., at zero concentration or below the level of detection.

As will be appreciated by those in the art, the components may be added in buffers and reagents to assay target protein activity and give optimal signals. Since the methods allow kinetic measurements, the incubation periods can be optimized to give adequate detection signals over the background.

In a preferred embodiment, an antifoam or a surfactant is included in the assay mixture. Suitable antifoams include, but are not limited to, antifoam 289 (Sigma). Suitable surfactants include, but are not limited to, Tween, Tritons, including Triton X-100, saponins, and polyoxyethylene ethers. Generally, the antifoams, detergents, or surfactants are added at a range from about 0.01 ppm to about 10 ppm.

A preferred assay design is also provided. In one aspect, the invention provides a multi-time-point (kinetic) assay, with at least two data points being preferred. In the case of multiple measurements, the absolute rate of the protein activity can be determined.

B. Binding Assays

In a preferred embodiment, the binding of the candidate agent is determined through the use of competitive binding assays. In this embodiment, the competitor is a binding moiety known to bind to the target protein, such as an antibody, peptide, binding partner, ligand, etc. Under certain circumstances, there may be competitive binding as between the candidate agent and the binding moiety, with the binding moiety displacing the candidate agent.

Competitive screening assays may be done by combining the target protein and a drug candidate in a first sample. A second sample comprises a candidate agent, the target protein and a compound that is known to modulate the target protein. This may be performed in either the presence or absence of microtubules. The binding of the candidate agent is determined for both samples, and a change, or difference in binding between the two samples indicates the presence of an agent capable of binding to the target protein and potentially modulating its activity. That is, if the binding of the candidate agent is different in the second sample relative to the first sample, the candidate agent is capable of binding to the target protein.

In one embodiment, the candidate agent is labeled. Either the candidate agent, or the competitor, or both, is added first to the target protein for a time sufficient to allow binding. Incubations may be performed at any temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation periods are selected for optimum activity, but may also be optimized to facilitate rapid high throughput screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally removed or washed away. The second component is then added, and the presence or absence of the labeled component is followed, to indicate binding.

In a preferred embodiment, the competitor is added first, followed by the candidate agent. Displacement of the competitor is an indication the candidate agent is binding to the target protein and thus is capable of binding to, and potentially modulating, the activity of the target protein. In this embodiment, either component can be labeled. Thus, for example, if the competitor is labeled, the presence of label in the wash solution indicates displacement by the agent. Alternatively, if the candidate agent is labeled, the presence of the label on the support indicates displacement.

In an alternative embodiment, the candidate agent is added first, with incubation and washing, followed by the competitor. The absence of binding by the competitor may indicate the candidate agent is bound to the target protein with a higher affinity. Thus, if the candidate agent is labeled, the presence of the label on the support, coupled with a lack of competitor binding, may indicate the candidate agent is capable of binding to the target protein.

C. Candidate Agents

Candidate agents encompass numerous chemical classes, though typically they are organic molecules, preferably small organic compounds having a molecular weight of more than 100 and less than about 2,500 daltons. Candidate agents comprise functional groups necessary for structural interaction with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more of the above functional groups. Candidate agents are also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred are peptides.

Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural compounds. In a preferred embodiment, the candidate agents are organic chemical moieties, a wide variety of which are available in the literature.

D. Other Assay Components

The assays provided utilize target protein as defined herein. In one embodiment, portions of target protein are utilized; in a preferred embodiment, portions having target protein activity as described herein are used. In addition, the assays described herein may utilize either isolated target proteins or cells or animal models comprising the target proteins.

A variety of other reagents may be included in the screening assays. These include reagents like salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal protein-protein binding and/or reduce non-specific or background interactions. Also, reagents that otherwise improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture of components may be added in any order that provides for the requisite binding.

IV. Applications

The methods of the invention are used to identify compounds useful in the treatment of cellular proliferation diseases. Disease states which can be treated by the methods and compositions provided herein include, but are not limited to, cancer (further discussed below), autoimmune disease, arthritis, graft rejection, inflammatory bowel disease, proliferation induced after medical procedures, including, but not limited to, surgery, angioplasty, and the like. It is appreciated that in some cases the cells may not be in a hyper or hypo proliferation state (abnormal state) and still require treatment. For example, during wound healing, the cells may be proliferating “normally”, but proliferation enhancement may be desired. Similarly, as discussed above, in the agriculture arena, cells may be in a “normal” state, but proliferation modulation may be desired to enhance a crop by directly enhancing growth of a crop, or by inhibiting the growth of a plant or organism which adversely affects the crop. Thus, in one embodiment, the invention herein includes application to cells or individuals afflicted or impending affliction with any one of these disorders or states.

The compositions and methods provided herein are particularly deemed useful for the treatment of cancer including solid tumors such as skin, breast, brain, cervical carcinomas, testicular carcinomas, etc. More particularly, cancers that may be treated by the compositions and methods of the invention include, but are not limited to: Cardiac: sarcoma (angiosarcoma, fibrosarcoma, rhabdomyosarcoma, liposarcoma), myxoma, rhabdomyoma, fibroma, lipoma and teratoma; Lung: bronchogenic carcinoma (squamous cell, undifferentiated small cell, undifferentiated large cell, adenocarcinoma), alveolar (bronchiolar) carcinoma, bronchial adenoma, sarcoma, lymphoma, chondromatous hamartoma, mesotheliorna; Gastrointestinal: esophagus (squamous cell carcinoma, adenocarcinoma, leiomyosarcoma, lymphoma), stomach (carcinoma, lymphoma, leiomyosarcoma), pancreas (ductal adenocarcinoma, insulinoma, glucagonoma, gastrinoma, carcinoid tumors, vipoma), small bowel (adenocarcinoma, lymphoma, carcinoid tumors, Karposi's sarcoma, leiomyoma, hemangioma, lipoma, neurofibroma, fibroma), large bowel (adenocarcinoma, tubular adenoma, villous adenoma, hamartoma, leiomyoma); Genitourinary tract: kidney (adenocarcinoma, Wilm's tumor [nephroblastoma], lymphoma, leukemia), bladder and urethra (squamous cell carcinoma, transitional cell carcinoma, adenocarcinoma), prostate (adenocarcinoma, sarcoma), testis (seminoma, teratoma, embryonal carcinoma, teratocarcinoma, choriocarcinoma, sarcoma, interstitial cell carcinoma, fibroma, fibroadenoma, adenomatoid tumors, lipoma); Liver: hepatoma (hepatocellular carcinoma), cholangiocarcinoma, hepatoblastoma, angiosarcoma, hepatocellular adenoma, hemangioma; Bone: osteogenic sarcoma (osteosarcoma), fibrosarcoma, malignant fibrous histiocytoma, chondrosarcoma, Ewing's sarcoma, malignant lymphoma (reticulum cell sarcoma), multiple myeloma, malignant giant cell tumor chordoma, osteochronfroma (osteocartilaginous exostoses), benign chondroma, chondroblastoma, chondromyxofibroma, osteoid osteoma and giant cell tumors; Nervous system: skull (osteoma, hemangioma, granuloma, xanthoma, osteitis deformans), meninges (meningioma, meningiosarcoma, gliomatosis), brain (astrocytoma, medulloblastoma, glioma, ependymoma, germinoma [pinealoma], glioblastoma multiform, oligodendroglioma, schwannoma, retinoblastoma, congenital tumors), spinal cord neurofibroma, meningioma, glioma, sarcoma); Gynecological: uterus (endometrial carcinoma), cervix (cervical carcinoma, pre-tumor cervical dysplasia), ovaries (ovarian carcinoma [serous cystadenocarcinoma, mucinous cystadenocarcinoma, unclassified carcinoma], granulosa-thecal cell tumors, Sertoli-Leydig cell tumors, dysgerminoma, malignant teratoma), vulva (squamous cell carcinoma, intraepithelial carcinoma, adenocarcinoma, fibrosarcoma, melanoma), vagina (clear cell carcinoma, squamous cell carcinoma, botryoid sarcoma (embryonal rhabdomyosarcoma], fallopian tubes (carcinoma); Hematologic: blood (myeloid leukemia [acute and chronic], acute lymphoblastic leukemia, chronic lymphocytic leukemia, myeloproliferative diseases, multiple myeloma, myelodysplastic syndrome), Hodgkin's disease, non-Hodgkin's lymphoma [malignant lymphoma]; Skin: malignant melanoma, basal cell carcinoma, squamous cell carcinoma, Karposi's sarcoma, moles dysplastic nevi, lipoma, angioma, dermatofibroma, keloids, psoriasis; and Adrenal glands: neuroblastoma. Thus, the term “cancerous cell” as provided herein, includes a cell afflicted by any one of the above identified conditions.

Accordingly, the compositions of the invention are administered to cells. By “administered” herein is meant administration of a therapeutically effective dose of the candidate agents of the invention to a cell either in cell culture or in a patient. By “therapeutically effective dose” herein is meant a dose that produces the effects for which it is administered. The exact dose will depend on the purpose of the treatment, and will be ascertainable by one skilled in the art using known techniques. As is known in the art, adjustments for systemic versus localized delivery, age, body weight, general health, sex, diet, time of administration, drug interaction and the severity of the condition may be necessary, and will be ascertainable with routine experimentation by those skilled in the art. By “cells” herein is meant almost any cell in which mitosis or meiosis can be altered.

A “patient” for the purposes of the present invention includes both humans and other animals, particularly mammals, and other organisms. Thus the methods are applicable to both human therapy and veterinary applications. In the preferred embodiment the patient is a mammal, and in the most preferred embodiment the patient is human.

Candidate agents having the desired pharmacological activity may be administered in a physiologically acceptable carrier to a patient, as described herein. Depending upon the manner of introduction, the compounds may be formulated in a variety of ways as discussed below. The concentration of therapeutically active compound in the formulation may vary from about 0.1-100 wt. %. The agents maybe administered alone or in combination with other treatments, i.e., radiation, or other chemotherapeutic agents.

In a preferred embodiment, the pharmaceutical compositions are in a water soluble form, such as pharmaceutically acceptable salts, which is meant to include both acid and base addition salts.

The pharmaceutical compositions can be prepared in various forms, such as granules, tablets, pills, suppositories, capsules, suspensions, salves, lotions and the like. Pharmaceutical grade organic or inorganic carriers and/or diluents suitable for oral and topical use can be used to make up compositions containing the therapeutically-active compounds. Diluents known to the art include aqueous media, vegetable and animal oils and fats. Stabilizing agents, wetting and emulsifying agents, salts for varying the osmotic pressure or buffers for securing an adequate pH value, and skin penetration enhancers can be used as auxiliary agents. The pharmaceutical compositions may also include one or more of the following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; coloring agents; and polyethylene glycol. Additives are well known in the art, and are used in a variety of formulations.

The administration of the candidate agents of the present invention can be done in a variety of ways as discussed above, including, but not limited to, orally, subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In some instances, for example, in the treatment of wounds and inflammation, the candidate agents may be directly applied as a solution or spray.

One of skill in the art will readily appreciate that the methods described herein also can be used for diagnostic applications. A diagnostic as used herein is a compound or method that assists in the identification and characterization of a health or disease state in humans or other animals.

The present invention also provides for kits for screening for modulators of the target protein. Such kits can be prepared from readily available materials and reagents. For example, such kits can comprise any one or more of the following materials: biologically active target protein, reaction tubes, and instructions for testing activity of the target protein. Preferably, the kit contains biologically active target protein. A wide variety of kits and components can be prepared according to the present invention, depending upon the intended user of the kit and the particular needs of the user. For example, the kit can be tailored for ATPase assays, microtubule gliding assays, or microtubule binding assays.

V. Examples

This assay is based on detection of ADP production from a target protein's microtubule stimulated ATPase. ATP production is monitored by a coupled enzyme system consisting of pyruvate kinase and lactate dehydrogenase. Under the assay conditions described below, pyruvate kianse catalyzes the conversion of ADP and phosphoenol pyruvate to pyruvate and ATP. Lactate dehydrogenase then catalyzes the oxidation-reduction reaction of pyruvate and NADH to lactate and NAD+. Thus, for each molecule of ADP produced, one molecule of NADH is consumed. The amount of NADH in the assay solution is monitored by measuring light absorbance at a wavelength of 340 nm.

The final 25 μl assay solution consists of the following: 5 μg/ml target protein, 30 μg/ml microtubules, 5 μM Taxol, 0.8 mM NADH, 1.5 mM phosphoenol pyruvate, 3.5 U/ml pyruvate kinase, 5 U/ml lactate dehydrogenase, 25 mM Pipes/KOH pH 6.8, 2mM MgCl₂, 1 mM EGTA, 1 mM MDTT, 0.1 mg/ml BSA, 0.001% antifoam 289, and 1 mM ATP.

Potential candidate agents are dissolved in DMSO at a concentration of about 1 mg/ml and 0.5 μl of each chemical solution is dispensed into a single well of a clear 384 well plate. Each of the 384 wells are then filled with 20 μl of a solution consisting of all of the assay components described above except for ATP. The plate is agitated at a high frequency. To start the assay, 5 μl of a solution containing ATP is added to each well. The plate is agitated and the absorbance is read at 340 nm over various time intervals. The assay is run at room temperature.

The assay components and the performance of the assay are optimized together to match the overall read time with the rate of the target protein's ADP production. The read time should be long enough for the rate of NADH consumption to reach steady state beyond an initial lag time of several seconds.

It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety.

6 1 1085 DNA Human 1 atgaggaact caagggcaac atccgtgtat tctgccgggt ccgccctgtc ctgccggggg 60 agcccactcc accccctggc ctcctcctgt ttccctctgg ccctggtggg ccctctgatc 120 ctccaacccg ccttagcctc tcccggtctg acgagcggcg tgggaccctg agtggggcac 180 cagctccccc aactcgccat gatttttcct ttgaccgggt attcccacca ggaagtggac 240 aggatgaagt gtttgaagag attgccatgc ttgtccagtc agccctggat ggctatccag 300 tatgcatctt tgcctatggc cagacaggca gtggcaagac cttcacaatg gagggtgggc 360 ctgggggaga cccccagttg gaggggctga tccctcgggc cctgcggcac ctcttctctg 420 tggctcagga gctgagtggt cagggctgga cctacagctt tgtagcaagc tacgtagaga 480 tctacaatga gactgtccgg gacctgctgg ccactggaac ccggaagggt caagggggcg 540 agtgtgagat tcgccgtgca gggccaggga gtgaggagct cactgtcacc aatgctcgat 600 atgtccctgt ctcctgtgag aaagaagtgg acgccctgct tcatctggcc cgccagaatc 660 gggctgtggc ccgcacagcc cagaatgaac ggtcatcacg cagccacagt gtattccagc 720 tacagatttc tggggagcac tccagccgag gcctgcagtg tggggccccc ctcagtcttg 780 tggacctggc cgggagtgag cgacttgacc ccggcttagc cctcggcccc ggggagcggg 840 aacgccttcg ggaaacacag gccattaaca gcagcctgtc cacgctgggg ctggttatca 900 tggccctgag caacaaggag tcccacgtgc cttaccggaa cagcaaactg acctacctgc 960 tgcagaactc tctgggtggt agtgctaaga tgctcatgtt tgtgaacatt tctccactgg 1020 aagagaacgt ctccgagtcc ctcaactctc tacgctttgc ctccaaggtg aaccagtgtg 1080 tttga 1085 2 361 PRT Human 2 Met Gln Glu Leu Lys Gly Asn Ile Arg Val Phe Cys Arg Val Arg Pro 1 5 10 15 Val Leu Pro Gly Glu Pro Thr Pro Pro Pro Gly Leu Leu Leu Phe Pro 20 25 30 Ser Gly Pro Gly Gly Pro Ser Asp Pro Pro Thr Arg Leu Ser Leu Ser 35 40 45 Arg Ser Asp Glu Arg Arg Gly Thr Leu Ser Gly Ala Pro Ala Pro Pro 50 55 60 Thr Arg His Asp Phe Ser Phe Asp Arg Val Phe Pro Pro Gly Ser Gly 65 70 75 80 Gln Asp Glu Val Phe Glu Glu Ile Ala Met Leu Val Gln Ser Ala Leu 85 90 95 Asp Gly Tyr Pro Val Cys Ile Phe Ala Tyr Gly Gln Thr Gly Ser Gly 100 105 110 Lys Thr Phe Thr Met Glu Gly Gly Pro Gly Gly Asp Pro Gln Leu Glu 115 120 125 Gly Leu Ile Pro Arg Ala Leu Arg His Leu Phe Ser Val Ala Gln Glu 130 135 140 Leu Ser Gly Gln Gly Trp Thr Tyr Ser Phe Val Ala Ser Tyr Val Glu 145 150 155 160 Ile Tyr Asn Glu Thr Val Arg Asp Leu Leu Ala Thr Gly Thr Arg Lys 165 170 175 Gly Gln Gly Gly Glu Cys Glu Ile Arg Arg Ala Gly Pro Gly Ser Glu 180 185 190 Glu Leu Thr Val Thr Asn Ala Arg Tyr Val Pro Val Ser Cys Glu Lys 195 200 205 Glu Val Asp Ala Leu Leu His Leu Ala Arg Gln Asn Arg Ala Val Ala 210 215 220 Arg Thr Ala Gln Asn Glu Arg Ser Ser Arg Ser His Ser Val Phe Gln 225 230 235 240 Leu Gln Ile Ser Gly Glu His Ser Ser Arg Gly Leu Gln Cys Gly Ala 245 250 255 Pro Leu Ser Leu Val Asp Leu Ala Gly Ser Glu Arg Leu Asp Pro Gly 260 265 270 Leu Ala Leu Gly Pro Gly Glu Arg Glu Arg Leu Arg Glu Thr Gln Ala 275 280 285 Ile Asn Ser Ser Leu Ser Thr Leu Gly Leu Val Ile Met Ala Leu Ser 290 295 300 Asn Lys Glu Ser His Val Pro Tyr Arg Asn Ser Lys Leu Thr Tyr Leu 305 310 315 320 Leu Gln Asn Ser Leu Gly Gly Ser Ala Lys Met Leu Met Phe Val Asn 325 330 335 Ile Ser Pro Leu Glu Glu Asn Val Ser Glu Ser Leu Asn Ser Leu Arg 340 345 350 Phe Ala Ser Lys Val Asn Gln Cys Val 355 360 3 1113 DNA Human 3 atgcaggaac tcaagggcaa catccgtgta ttctgccggg tccgccctgt cctgccgggg 60 gagcccactc caccccctgg cctcctcctg tttccctctg gccctggtgg gccctctgat 120 cctccaaccc gccttagcct ctcccggtct gacgagcggc gtgggaccct gagtggggca 180 ccagctcccc caactcgcca tgatttttcc tttgaccggg tattcccacc aggaagtgga 240 caggatgaag tgtttgaaga gattgccatg cttgtccagt cagccctgga tggctatcca 300 gtatgcatct ttgcctatgg ccagacaggc agtggcaaga ccttcacaat ggagggtggg 360 cctgggggag acccccagtt ggaggggctg atccctcggg ccctgcggca cctcttctct 420 gtggctcagg agctgagtgg tcagggctgg acctacagct ttgtagcaag ctacgtagag 480 atctacaatg agactgtccg ggacctgctg gccactggaa cccggaaggg tcaagggggc 540 gagtgtgaga ttcgccgtgc agggccaggg agtgaggagc tcactgtcac caatgctcga 600 tatgtccctg tctcctgtga gaaagaagtg gacgccctgc ttcatctggc ccgccagaat 660 cgggctgtgg cccgcacagc ccagaatgaa cggtcatcac gcagccacag tgtattccag 720 ctacagattt ctggggagca ctccagccga ggcctgcagt gtggggcccc cctcagtctt 780 gtggacctgg ccgggagtga gcgacttgac cccggcttag ccctcggccc cggggagcgg 840 gaacgccttc gggaaacaca ggccattaac agcagcctgt ccacgctggg gctggttatc 900 atggccctga gcaacaagga gtcccacgtg ccttaccgga acagcaaact gacctacctg 960 ctgcagaact ctctgggtgg tagtgctaag atgctcatgt ttgtgaacat ttctccactg 1020 gaagagaacg tctccgagtc cctcaactct ctacgctttg cctccaaggt gaaccagtgt 1080 gttattggta ctgctcaggc caacaggaag tga 1113 4 370 PRT Human 4 Met Gln Glu Leu Lys Gly Asn Ile Arg Val Phe Cys Arg Val Arg Pro 1 5 10 15 Val Leu Pro Gly Glu Pro Thr Pro Pro Pro Gly Leu Leu Leu Phe Pro 20 25 30 Ser Gly Pro Gly Gly Pro Ser Asp Pro Pro Thr Arg Leu Ser Leu Ser 35 40 45 Arg Ser Asp Glu Arg Arg Gly Thr Leu Ser Gly Ala Pro Ala Pro Pro 50 55 60 Thr Arg His Asp Phe Ser Phe Asp Arg Val Phe Pro Pro Gly Ser Gly 65 70 75 80 Gln Asp Glu Val Phe Glu Glu Ile Ala Met Leu Val Gln Ser Ala Leu 85 90 95 Asp Gly Tyr Pro Val Cys Ile Phe Ala Tyr Gly Gln Thr Gly Ser Gly 100 105 110 Lys Thr Phe Thr Met Glu Gly Gly Pro Gly Gly Asp Pro Gln Leu Glu 115 120 125 Gly Leu Ile Pro Arg Ala Leu Arg His Leu Phe Ser Val Ala Gln Glu 130 135 140 Leu Ser Gly Gln Gly Trp Thr Tyr Ser Phe Val Ala Ser Tyr Val Glu 145 150 155 160 Ile Tyr Asn Glu Thr Val Arg Asp Leu Leu Ala Thr Gly Thr Arg Lys 165 170 175 Gly Gln Gly Gly Glu Cys Glu Ile Arg Arg Ala Gly Pro Gly Ser Glu 180 185 190 Glu Leu Thr Val Thr Asn Ala Arg Tyr Val Pro Val Ser Cys Glu Lys 195 200 205 Glu Val Asp Ala Leu Leu His Leu Ala Arg Gln Asn Arg Ala Val Ala 210 215 220 Arg Thr Ala Gln Asn Glu Arg Ser Ser Arg Ser His Ser Val Phe Gln 225 230 235 240 Leu Gln Ile Ser Gly Glu His Ser Ser Arg Gly Leu Gln Cys Gly Ala 245 250 255 Pro Leu Ser Leu Val Asp Leu Ala Gly Ser Glu Arg Leu Asp Pro Gly 260 265 270 Leu Ala Leu Gly Pro Gly Glu Arg Glu Arg Leu Arg Glu Thr Gln Ala 275 280 285 Ile Asn Ser Ser Leu Ser Thr Leu Gly Leu Val Ile Met Ala Leu Ser 290 295 300 Asn Lys Glu Ser His Val Pro Tyr Arg Asn Ser Lys Leu Thr Tyr Leu 305 310 315 320 Leu Gln Asn Ser Leu Gly Gly Ser Ala Lys Met Leu Met Phe Val Asn 325 330 335 Ile Ser Pro Leu Glu Glu Asn Val Ser Glu Ser Leu Asn Ser Leu Arg 340 345 350 Phe Ala Ser Lys Val Asn Gln Cys Val Ile Gly Thr Ala Gln Ala Asn 355 360 365 Arg Lys 370 5 1110 DNA Human 5 atggaactca agggcaacat ccgtgtattc tgccgggtcc gccctgtcct gccgggggag 60 cccactccac cccctggcct cctcctgttt ccctctggcc ctggtgggcc ctctgatcct 120 ccaacccgcc ttagcctctc ccggtctgac gagcggcgtg ggaccctgag tggggcacca 180 gctcccccaa ctcgccatga tttttccttt gaccgggtat tcccaccagg aagtggacag 240 gatgaagtgt ttgaagagat tgccatgctt gtccagtcag ccctggatgg ctatccagta 300 tgcatctttg cctatggcca gacaggcagt ggcaagacct tcacaatgga gggtgggcct 360 gggggagacc cccagttgga ggggctgatc cctcgggccc tgcggcacct cttctctgtg 420 gctcaggagc tgagtggtca gggctggacc tacagctttg tagcaagcta cgtagagatc 480 tacaatgaga ctgtccggga cctgctggcc actggaaccc ggaagggtca agggggcgag 540 tgtgagattc gccgtgcagg gccagggagt gaggagctca ctgtcaccaa tgctcgatat 600 gtccctgtct cctgtgagaa agaagtggac gccctgcttc atctggcccg ccagaatcgg 660 gctgtggccc gcacagccca gaatgaacgg tcatcacgca gccacagtgt attccagcta 720 cagatttctg gggagcactc cagccgaggc ctgcagtgtg gggcccccct cagtcttgtg 780 gacctggccg ggagtgagcg acttgacccc ggcttagccc tcggccccgg ggagcgggaa 840 cgccttcggg aaacacaggc cattaacagc agcctgtcca cgctggggct ggttatcatg 900 gccctgagca acaaggagtc ccacgtgcct taccggaaca gcaaactgac ctacctgctg 960 cagaactctc tgggtggtag tgctaagatg ctcatgtttg tgaacatttc tccactggaa 1020 gagaacgtct ccgagtccct caactctcta cgctttgcct ccaaggtgaa ccagtgtgtt 1080 attggtactg ctcaggccaa caggaagtga 1110 6 369 PRT Human 6 Met Glu Leu Lys Gly Asn Ile Arg Val Phe Cys Arg Val Arg Pro Val 1 5 10 15 Leu Pro Gly Glu Pro Thr Pro Pro Pro Gly Leu Leu Leu Phe Pro Ser 20 25 30 Gly Pro Gly Gly Pro Ser Asp Pro Pro Thr Arg Leu Ser Leu Ser Arg 35 40 45 Ser Asp Glu Arg Arg Gly Thr Leu Ser Gly Ala Pro Ala Pro Pro Thr 50 55 60 Arg His Asp Phe Ser Phe Asp Arg Val Phe Pro Pro Gly Ser Gly Gln 65 70 75 80 Asp Glu Val Phe Glu Glu Ile Ala Met Leu Val Gln Ser Ala Leu Asp 85 90 95 Gly Tyr Pro Val Cys Ile Phe Ala Tyr Gly Gln Thr Gly Ser Gly Lys 100 105 110 Thr Phe Thr Met Glu Gly Gly Pro Gly Gly Asp Pro Gln Leu Glu Gly 115 120 125 Leu Ile Pro Arg Ala Leu Arg His Leu Phe Ser Val Ala Gln Glu Leu 130 135 140 Ser Gly Gln Gly Trp Thr Tyr Ser Phe Val Ala Ser Tyr Val Glu Ile 145 150 155 160 Tyr Asn Glu Thr Val Arg Asp Leu Leu Ala Thr Gly Thr Arg Lys Gly 165 170 175 Gln Gly Gly Glu Cys Glu Ile Arg Arg Ala Gly Pro Gly Ser Glu Glu 180 185 190 Leu Thr Val Thr Asn Ala Arg Tyr Val Pro Val Ser Cys Glu Lys Glu 195 200 205 Val Asp Ala Leu Leu His Leu Ala Arg Gln Asn Arg Ala Val Ala Arg 210 215 220 Thr Ala Gln Asn Glu Arg Ser Ser Arg Ser His Ser Val Phe Gln Leu 225 230 235 240 Gln Ile Ser Gly Glu His Ser Ser Arg Gly Leu Gln Cys Gly Ala Pro 245 250 255 Leu Ser Leu Val Asp Leu Ala Gly Ser Glu Arg Leu Asp Pro Gly Leu 260 265 270 Ala Leu Gly Pro Gly Glu Arg Glu Arg Leu Arg Glu Thr Gln Ala Ile 275 280 285 Asn Ser Ser Leu Ser Thr Leu Gly Leu Val Ile Met Ala Leu Ser Asn 290 295 300 Lys Glu Ser His Val Pro Tyr Arg Asn Ser Lys Leu Thr Tyr Leu Leu 305 310 315 320 Gln Asn Ser Leu Gly Gly Ser Ala Lys Met Leu Met Phe Val Asn Ile 325 330 335 Ser Pro Leu Glu Glu Asn Val Ser Glu Ser Leu Asn Ser Leu Arg Phe 340 345 350 Ala Ser Lys Val Asn Gln Cys Val Ile Gly Thr Ala Gln Ala Asn Arg 355 360 365 Lys 

What is claimed is:
 1. An isolated nucleic acid sequence, wherein the nucleic acid encodes SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6.
 2. An isolated nucleic acid sequence, wherein the nucleic acid has a nucleotide sequence of SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5.
 3. An expression vector comprising a nucleic acid sequence of claim
 1. 4. A host cell transfected with the vector of claim
 3. 5. An expression vector comprising a nucleic acid of claim
 2. 6. A host cell transfected with the vector of claim
 5. 